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(54) Method and apparatus for processing a pixel map 



(57) An improved technique for processing a color 
or gray scale pixel map representing a document is dis- 
closed. The pixel map is decomposed into a three-plane 
representation, a reduced-resolution "upper" plane (12), 
a reduced-resolution "lower" plane (14), and a high-res- 
olution binary selector plane (16). The "upper" and "low- 
er" planes contain the color or gray scale for the page 
as well as the continuous tone pictures that are con- 



tained on the page. The selector plane (16) stores infor- 
mation for selecting from either the foreground plane or 
background plane during decompression. Information 
contained in the selector plane is first used to pre-proc- 
ess the upper and lower planes to reduce the amount 
of data on each of the other two planes that will be sub- 
jected to further processing. Each of the pre-processed 
planes is compressed using a compression technique 
optimal for the type of data that resides upon it. 
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Description 

[0001] This invention relates generally to image 
processing and, more particularly, to techniques for 
processing, e.g. compressing, a pixel map. s 
[0002] Data contained in documents that have been 
scanned at high resolutions requires very large amounts 
of storage space. This data is typically subjected to 
some form of data compression in order to avoid the 
high costs that would be associated with storing it. io 
"Lossless" compression methods such as Lempel-Ziv 
Welch (LZW) do not perform particularly well on portions 
of the document that are scanned pixel maps; "lossy" 
methods such as JPEG work fairly well on continuous- 
tone pixel maps, but they do not work particularly well 1$ 
on the parts of the document that contain text. To opti- 
mize image data compression, techniques, which can 
recognize the type of data being compressed, are need- 
ed. 

[0003] Examples of known compression techniques 20 
are described in US-A-5778092, 5251271, 5060980, 
5784175 and 5303313. 

[0004] In one embodiment of the invention, a method 
of compressing a pixel map representation of a docu- 
ment is disclosed, the method including the steps of ere- 25 
ating a first image plane from the pixel map representa- 
tion of a document, the first image plane containing color 
signals for a first subset of image data in the document; 
creating a second image plane from the pixel map rep- 
resentation of the document, the second image plane 30 
containing image signals for a second subset of image 
data in the document: creating a selector plane from the 
pixel map representation of the document, wherein the 
selector plane selects between the first subset signals 
and the second subset signals when decompressing the 35 
compressed pixel map representation of the document; 
and pre-processing the first image plane and the second 
image plane based upon data contained in the selector 
map. 

[0005] In another embodiment of the invention an ap- 40 
paratus for processing pixel map representations of 
documents is disclosed, including: means for creating a 
first image plane from the pixel map representation of a 
document, the first image plane containing color signals 
for a first subset of image data in the document; means 4 & 
for creating a second image plane from the pixel map 
representation of the document, the second image 
plane containing image signals for a second subset of 
image data in the document; means for creating a se- 
lector plane from the pixel map representation of the so 
document, wherein the selector plane selects between 
the first subset signals and the second subset signals 
when decompressing the compressed pixel map repre- 
sentation of the document; and means for pre-process- 
ing the first image plane and the second image plane ss 
based upon data contained in the selector map. 
[0006] In the preferred embodiment of the invention, 
a program storage device tangibly embodying a pro- 
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gram of instructions executable by a machine to perform 
method steps for compressing a document image is dis- 
closed, wherein the method steps include: creating a 
first image plane from the pixel map representation of a 
document, the first image plane containing color signals 
for a first subset of image data in the document; creating 
a second image plane from the pixel map representation 
of the document, the second image plane containing im- 
age signals for a second subset of image data in the 
document; creating a selector plane from the pixel map 
representation of the document, wherein the selector 
plane selects between the first subset signals and the 
second subset signals when decompressing the com- 
pressed pixel map representation of the document: and 
pre-processing the first image plane and the second im- 
age plane based upon data contained in the selector 
map. 

[0007] Some examples of methods and apparatus ac- 
cording to the invention will now be described with ref- 
erence to the accompanying drawings, in which> 

Figure 1 illustrates a composite image and includes 
an example of how such an image may be decom- 
posed into three MRC image planes, an upper 
plane, a lower plane, and a selector plane; 
Figure 2 contains a flowchart illustrating the basic 
steps for compressing a document according to the 
present invention; 

Figure 3 shows a detailed example of the typical 
contents of a selector plane for an 8 x 6 block of 
pixels; 

Figure 4 shows a detailed example of an image 
plane map which corresponds to the selector map 
of Figure 3; 

Figure 5 depicts one embodiment of the present in- 
vention for pre-processing image planes: 
Figure 6 illustrates another embodiment of the 
present invention for pre-processing image planes; 
Figure 7 shows the manner in which a near non- 
destructive embodiment of the present invention 
may be used in conjunction with a JPEG compres- 
sion system to pre-process image planes for sub- 
sequent JPEG compression; 

Figure 8 contains a detailed illustration of an itera- 
tive smoothing technique that may be used in con- 
junction with the present invention: 
Figure 9 illustrates an alternative smoothing tech- 
nique; 

Figure 10 illustrates a typical block: and, 

Figure 11 illustrates a typical device in which the 

present invention may be implemented. 

[0008] The present invention is directed to a method 
and apparatus for separately processing the various 
types of data contained in a composite image. While the 
invention is described in conjunction with a Mixed 
Raster Content (MRC) representation technique, those 
skilled in the art will recognize that it may be adapted 
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for use with other methods and apparatus' and the in- 
vention is therefore; not limited to this description. The 
technique described herein is suitable for use in various 
devices required to store or transmit color or grayscale 
documents such as facsimile devices, image storage 
devices and the like. It should be noted that the exam- 
ples and illustrations presented in the figures are in gray 
scale : but the same concepts apply to color documents 
and conversely : those portions of the invention that are 
described with reference to color documents apply 
equally to gray scale documents. 
[0009] A pixel map is one in which each pixel repre- 
sents some "value 0 which indicates the color or, in the 
case of gray scale document, how light or dark the im- 
age is at that point. As those skilled in the art will appre- 
ciate, most pixel maps have values that are taken from 
a set of discrete, non-negative integers. For example, a 
typical gray-scale pixel map might have values ranging 
from 0 : representing black, to 255, representing the 
whitest tone possible. The pixel maps of concern in the 
currently preferred embodiment are representations of 
"scanned" images. That is, images which are created 
by digitizing light reflected off of physical media using a 
digital scanner. The term bitmap is used to mean a bi- 
nary pixel map in which pixels can take one of two val- 
ues, 1 or 0. An example of a device that may be used 
to obtain such scanned images is illustrated in Figure 
11 which illustrates a processing system 1701 including 
a microprocessor 1702, a user interface 1704 and a 
memory module 1705. The system 1701 is coupled to 
a scanner 1703 and a printer 1706. 
[0010] Turning now to the drawings for a general de- 
scription of the invention, as indicated in Figure 1, pixel 
map 10 representing a color or gray-scale document is 
preferably decomposed into a three plane page format. 
The document format is comprised of an upper plane 
12, a lower plane. 14, and a selector plane 16. Upper 
plane 12 and lower plane 14 are typically stored at the 
same bit depth and number of colors as the original pixel 
map 10, but usually at reduced resolution. However as 
those skilled in the art will appreciate, the processing of 
planes can include a reduction in the bit depth or a pal- 
ette color encoding. It is important to recognize that 
while the terms "upper 0 and "lower" are used to describe 
the planes on which data resides, it is not intended to 
limit the invention to any particular arrangement. Fur- 
ther, it is also possible to practice the invention with 
planes that are composed of multiple superimposed 
separations. If this is the case, it is possible to apply the 
present invention to all separations together or to each 
color separation individually. 

[0011] Processing typically occurs on a block by block 
basis, rather than by simultaneously processing all the 
image data. For example, if JPEG compression will be 
applied, 8x8 blocks must be provided. That is, the im- 
age data must be separated into groups of 64 pixels, 
with 8 pixels extending in the horizontal direction and 8 
blocks extending in the vertical direction. JPEG is mere- 



ly an example of one compression format that may be 
used with the present invention. The blocks may be or- 
ganized in another configuration if required by the tech- 
nique that will be used. After all blocks are processed, 

s any or all three planes may be compressed using a 
method suitable for the type data residing thereon. Con- 
tinuing with the example already provided, upper plane 
12 and lower plane 14 may be compressed and stored 
using JPEG, while selector plane 16 is compressed us- 

io ing a symbol-based compression format. It would be ap- 
parent to one of skill in the art to compress and store the 
planes using other formats that are suitable for the in- 
tended use of the document. For example, in the Color 
Facsimile arena, group 4 (MMR) would preferably used 

is for the selector plane, since the particular compression 
format used must be one of the approved formats 
(MMR, MR, MH, JPEG, JBIG, etc.) for facsimile data 
transmission. 

[0012] Lower plane 14 commonly contains both infor- 

20 mation that is pertinent to the background color of the 
page (including the color of tints, washes, etc.) and the 
continuous-tone pictures that are found on the page. 
Upper plane 12 commonly contains the "ink colors" of 
foreground items such as text. Selector plane 16 is typ- 

25 ically stored at higher resolution than the upper and low- 
er planes. Selector plane 1 6 is used to describe, for each 
pixel in the selector plane, whether to use the pixel value 
found in the lower plane or the upper plane during image 
reconstruction. If a "white" pixel in the selector plane (i. 

30 e. a logical zero value) means the pixel value should be 
taken from the corresponding pixel from the lower plane, 
a "black" pixel in the selector plane (i.e. a logical one 
value) means that the pixel value should be taken from 
the corresponding pixel from the upper plane. 

35 [0013] Figure 2 contains a flowchart depicting the ba- 
sic steps for compressing a document using an embod- 
iment of the present invention. Blocks from an original 
pixel map 10 - a pixel map representation of the original 
document to be compressed — are first obtained as in- 

40 dicated in step 102. This may be through scanning an 
original, by retrieving a stored pixel map representation 
of the document, or by converting an electronic or page 
description language representation of an original doc- 
ument into a pixel map representation. Pixel map 10 rep- 

45 resentation is then analyzed to generate the information 
for the three planes as indicated in steps 104-108. Se- 
lector plane 16 is implicitly or explicitly computed first, 
as indicated in step 104 and is used to create the other 
planes. Those skilled in the art will recognize that use 

50 of the phrase "implicitly or explicitly" refers to the fact 
that the invention does not require actual calculation and 
generation of selector plane 16. While selector plane 16 
can be generated, the invention may be accomplished 
by simply moving pixels from one plane to another, and 

55 marking the pixels that have been moved. Technically, 
this calculates one plane such as lower plane 14 first, 
but simultaneously it implicitly calculates selector plane 
16. 
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[001 4] Selector plane 1 6 is typically a bitmap comput- 
ed using a technique suitable lor finding text or the like 
on original pixel map 10. What results is a bitmap where 
pixels have a 1 value where they represent text and a 0 
elsewhere. It should be noted that the term "text" refers 
to page objects that have text properties, such as sharp, 
high contrast edges, etc., including many other objects 
that do not qualify as "readable" text. Pixels are placed 
on either upper plane 1 2 or lower plane 14 according to 
the data on selector plane 16. 

[0015] An upper plane 12, typically stored at a re- 
duced resolution relative to original pixel map 10, con- 
tains color (or gray scale) information of upper items 
such as text is computed using selector plane as indi- 
cated in step 106. Briefly, creating upper plane 12 in- 
volves creating an image containing the color of the ob- 
jects (pixels) selected in the selector plane. Conceptu- 
ally, the method can be viewed as pouring ink contents 
of the upper plane through a mask located on the selec- 
tor plane onto the background of the lower plane. The 
ink colors are placed in a reduced-resolution "ink map" 
that will ultimately become upper plane 12. Without the 
present invention, the empty values are typically filled 
in with pre-computed ink colors. 

[0016] A lower plane 1 4, also typically stored at a low- 
er resolution than original pixel map 10, is then comput- 
ed as indicated in step 108. In this step, one embodi- 
ment of the invention includes an image segmentation 
process that identifies the "image" or non-text portions. 
This information is used to create the reduced resolution 
lower map, which contains background color informa- 
tion as well as continuous tone image information. The 
result is an image that has ail small, text-like features 
deleted, but which includes tints as well as color or gray 
scale data. 

[0017] Once the three planes have been generated, 
either or all of them may be compressed at steps 
110-114 using a technique suitable for compressing the 
type of data that lies thereon. The compressed data rep- 
resenting each plane can be recombined at step 116, 
after the necessary compression has taken place, in or- 
der to create a single representation of the data, for stor- 
age in a computer file, or transmission in a single chan- 
nel. If case multipie transmission channels are available 
step 116 may not be necessary. 

[0018] The present invention includes a method and 
apparatus which pre-processes the data on upper plane 
12 and lower plane 14 using the information contained 
on selector plane 16. Turning now to Figure 3, as stated 
earlier selector plane 16 includes a pattern of zeros and 
ones, dispersed in an 8 x 8 block. An 8 x 8 block such 
as that illustrated here corresponds to an 8 x 8 block of 
data that is provided by the compressor which, in the 
preferred embodiment of the invention, will be a JPEG 
compressor. If a compression technique that provides 
data in another configuration is used, selector plane 16 
will have the zeros and ones placed thereon, dispersed 
in a corresponding pattern. As stated earlier, it is as- 
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sumed here that a 0 on selector plane 16 means that 
the pixel value should be taken from the corresponding 
pixel from the lower plane 14, while a 1 on the selector 
plane means that the pixel value should be taken from 

s the corresponding pixel from upper plane 12. 

[0019] In the preferred embodiment of the present in- 
vention, when processing image planes that will be re- 
duced for compression, the block size used in the pre- 
processing step may be enlarged to compensate for the 

10 reduction in image size, so that the final processed block 
size matches the block size used for compressing the 
image plane. 

[0020] Referring now to Figure 4, image plane maps 
that identify the pixels in each block that will be used to 
15 reconstruct the final output image from the two planes 
are next created. For lower plane 14 map 304, is created 
wherein an "N" is placed in every location in which a 1 
was located on selector plane 16 to mark the pixels in 
the block that will not be used during image reconstruc- 
ts tion. A "Y" is placed in those locations in which 0's were 
located on selector plane 16 to show the pixels in the 
block that are to be retained for the output image. Sim- 
ilarly, for upper plane 12, map 302 is created and NTs 
are placed in those locations which correspond to 0's on 
25 selector plane 16 : while Y's are placed in the locations 
that correspond to Vs. Those skilled in the art will rec- 
ognize that the second map generated may be created 
by simply inverting the first map. 

[0021] In one embodiment of the invention, referred 

30 to as non-destructive processing, retained ("Y" labeled) 
pixels are never modified. As indicated in Figure 5, the 
first step 402 is to determine the number of locations in 
the block in the image plane map 302 or 304 that have 
been identified as disposable ("N" pixels). For simplicity, 

35 the invention will continue to be described with refer- 
ence to a block in the lower plane 14. As shown in step 
406, if no locations in image plane map 304 have been 
identified as N locations, the block is simply output as 
is. Note that the average "A" ot the block is implicitly or 

40 explicitly computed before it is output in step 406. Those 
skilled in the art will recognize that average "A" could be 
obtained by re-using the DC term of the JPEG compres- 
sion, and that while an explicit calculation may occur, it 
is not necessary. On the other hand, if all locations in 

45 image plane map 304 have been identified as N loca- 
tions, all of the pixels in the block that lie on lower plane 
14 can be set to a constant value. In one embodiment 
of the invention, the constant value is set equal to the 
average of all pixels values in the previously processed 

50 block, i.e. set to "A". Those skilled in the art will recog- 
nize that numerous methods can be used to calculate 
the most appropriate constant value, and that the inven- 
tion is not limited to using this average. Lower plane 14 
with its newly assigned values is then output at step 406 

55 [0022] With continued reference to Figure 5, if neither 
all nor none of the pixels on image plane map 304 in the 
block being processed have been identified as N pixels 
(i.e. the number of N identified pixels is not equal to ei- 
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ther zero or the maximum value which, in the case of 
JPEG compression would be 64) all of the pixels in the 
block that correspond to Y locations on image plane 
map 304 are replaced with values that will enhance the 
compression of the block. Specifically, values placed on 
lower plane 14 will be those that will minimize the 
amount of data that will be generated during image com- 
pression. In the preferred embodiment of the invention, 
these values will be provided using an iterative image 
smoothing technique, which will be described in detail 
later (see Figure 8 and corresponding discussion). Low- 
er plane 14 with its newly updated values is then output 
at step 406. 

[0023] It is important to understand that even in the 
non -destructive case, artifacts can occur during decom- 
pression that are caused by pixel values at N locations. 
Assume for simplicity that all Y pixels have a value of 
200. Filling all N pixels with value 55 will produce a ring- 
ing artifact that protrudes into the area of Y-marked pix- 
els. It is therefor necessary and one intention of this in- 
vention, to use values for the N-marked pixels that op- 
timize compression while not introducing artifacts in the 
Y-marked regions on decompression. 
[0024] Turning now to Figure 6 : in another embodi- 
ment of the invention a "near non-destructive process- 
rig" technique is used to process image data. The 
o^rase "near non-destructive is used to indicate that 
some of the Y labeled pixels in the block are likely to be 
sijqmiy modified using this approach. Near nondestruc- 
tive processing is generally accomplished by determin- 
ing now much variance there is between the Y labeled 
pixels on the image plane and then comparing that var- 
iance to some pre-determined threshold value. If the 
variance of the Y labeled pixels is small enough, 
processing time can be reduced by replacing the entire 
image data block with a block of pixels that has a uniform 
value Thus, the embodiment first requires inputting 
threshold and computing the variance of the block. The 
threshold value indicates the maximum amount of dis- 
tortion that is acceptable for decompression. The vari- 
ance indicates the activity of the block - whether there 
are large variations in the type (i.e. text, pictorial) of im- 
age data within the given block. 

[0025] As before, the process begins by determining 
the number of locations in the image plane map 304 for 
which the block has N identified pixels, as indicated in 
step 402. If no locations in the block have been identified 
as N pixels, the block is output in step 406 and as before, 
if all locations in the block have been identified as N pix- 
els in image plane map 304, all of the pixels in the block 
are replaced at step 41 0 with pixels that have a constant 
value such as the average value for pixels in a previous- 
ly processed block, or some other appropriate value. 
[0026] Near non-destructive processing may be ap- 
plied if image plane map 304 has neither all N marked 
pixels or all Y marked pixels. If this is the case, process- 
ing of the image is dependent upon the relationship be- 
tween the variance and the threshold described above 



and illustrated in step 41 4. If the variance is greater than 
or equal to the threshold, all pixels in lower plane 1 4 that 
are in locations which correspond to those identified with 
N's on image plane map 304 are again replaced with 
s values that will enhance compression. As before, the 
preferred embodiment of the invention includes calcu- 
lating these values using an iterative image smoothing 
technique such as the one described below. The block 
will then be output at step 406. 

10 [0027] If the variance is less than the threshold, the 
entire block will be replaced by a uniform block with pix- 
els that have a constant value as indicated in step 416. 
It should be noted here that the constant value used at 
step 416 will typically not be the same as that which 

is would have been used if all of the pixels had been 
marked with N (step 410). While minimizing the amount 
of data that will be generated during image decompres- 
sion is still the goal in this step, a different averaging 
technique will often be required to accomplish that task. 

20 in the preferred embodiment of the invention, the aver- 
age of the pixels corresponding to locations marked with 
Y's will be calculated and that value will be the constant 
used in step 416. Again, those skilled in the art will rec- 
ognize that numerous methods may be used to calcu- 

25 late the most appropriate constant value and the inven- 
tion is not limited to this embodiment. 
[0028] As explained earlier, in the preferred embodi- 
ment the present invention will be implemented using 
JPEG compression to compress upper plane 12 and 

30 lower plane 14. Thus, a simplifying method can be ap- 
plied by incorporating the present invention within the 
JPEG compression module. An implementation of this 
embodiment is provided in Figure 7. As stated above, 
the process begins by determining the number of N lo- 
ss cations in the image plane map 304 for the block at step 
402. If there are no N locations in the block, the block is 
still output as before, the difference here being that, as 
shown in step 606, output 620 is preceded by JPEG en- 
coding. Thus the output module 620 actually outputs a 

40 variable amount of bits generated by the JPEG com- 
pression process. 

[0029] If all locations in image plane map 304 for the 
block have been identified as N pixels, the JPEG com- 
pliant bitstring relative to the 0 DC difference is output, 

45 followed by an end of block symbol. When decoding the 
image data, those two symbols indicate that the current 
block is uniform and has the same average as the pre- 
viously coded block. This bitstring will be 01 01 1 0 for de- 
fault luminance tables as indicated in step 610 and is 

so perhaps the shortest possible valid string to represent 
a block in JPEG. The resulting data block plane will 
again be output at step 620. The motivation for using 
the average of the previous block as opposed to the cur- 
rent one is now clear since by using this method, the 

55 amount of JPEG compressed data for the block being 
processed will be minimal. 

[0030] Still referring to Figure 7, assuming the near 
non-destructive processing method is being used, the 
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next step occurs when neither all nor none of the pixels 
on image plane map 304 has been identified as N pixels. 
What takes place during this next step again depends 
upon the relationship between the variance and the 
threshold described above. Looking first at step 414 if 
the variance is greater than or equal to the threshold, all 
pixels in the plane in locations which correspond to 
those identified with Y's on image plane map 304 are 
replaced with values that will enhance decompression 
(i.e. minimize the amount of generated data). Again, in 
the preferred embodiment of the invention, this will be 
an iterative smoothing technique. The "smoothed" block 
is then compressed using JPEG at step 612, and the 
compressed data bits are output at step 620. 
[0031] If the variance is less than the threshold, the 
plane will be replaced by a uniform block of pixels at 
step 608. In one embodiment of the invention, the value 
of the uniform block will be equal to the average of pixels 
in the block that have been marked with Ys in image 
plane map 304. Output using simplified JPEG encoding 
will take place at step 616. Use of the term "simplified" 
JPEG encoding means that the block average is used 
as the DC value of the discrete cosine transform (DCT) 
which is the only DCT value to be encoded and output. 
Therefore, the DCT computation and the quantization 
or encoding of DCT AC values for the block do not take 
place. 

[0032] Referring now to Figure 8 : the details of one 
embodiment of an iterative image smoothing technique, 
step 412 in Figures 5, 6 and 7 will now be described. As 
indicated above, one way to enhance compression is to 
replace the N pixel values in the block with values that 
will compress better, since those values will not be used 
during reconstruction anyway. 

[0033] The fact that iterative smoothing is being ap- 
plied means that there are initially at least some N's on 
the map. Those N locations that have at least one ver- 
tical or horizontal Y neighbor are noted. It should be 
pointed out that diagonal neighbors are not counted du r- 
ing this part of the process. Next, the values of all pixels 
in the block that correspond to the selected N locations 
will be replaced by the average of all of their neighboring 
pixels that correspond to previously identified Y loca- 
tions as indicated in step 706. Diagonal as well as ver- 
tical and horizontal neighbors may be included in this 
averaging. The replaced pixels are identified with Y's in 
corresponding locations in image plane map block 304 
as indicated in step 708. The status of map 304 is 
checked at step 702 to determine whether all locations 
in image plane map 304 that correspond to locations in 
the block are identified with Y's. The process is repeated 
until this is the case (i.e. until there are no more N's on 
image map 304 in locations that correspond to those in 
the block). Once no more N locations the block is output 
as indicated in step 406. 

[0034] Turning now to Figure 9, another way to per- 
form iterative smoothing for step 412 is to use a discrete 
cosine transform. As before, N locations that have at 



least one vertical or horizontal Y neighbor replaced by 
the average of all of pixels that correspond to Y locations 
as indicated in step 802. A discrete cosine transforma- 
tion (DCT) is then applied to this new block, and the re- 
5 suits of the transformation are quantized as indicated in 
step 804. 

[0035] In one embodiment of the invention, some of 
the high frequency coefficients are removed at step 806. 
It is anticipated that several iterations will occur before 

to this process has been completed. How many levels of 
high frequency coefficients that are removed will de- 
pend upon how many iterations have occurred, with the 
number of levels removed in direct proportion to the 
number of iterations. 

is [0036] Turning for a moment to Figure 10, an 8 x 8 
block which describes frequency "levels" is provided. As 
shown, the 0 level coefficient is the DCC. The first level 
coefficients are marked by 1 's, second level coefficients 
marked by 2's, third level marked by 3's etc., until coef- 

20 ficients in all 14 levels are identified. Thus, when it is 
desired to use only the first level coefficients, three pixel 
values will be provided - those values marked with num- 
bers less than or equal to 1 . If the fourth level coefficients 
are to be used, fifteen pixel values, those marked with 

25 numbers less than or equal to 4, will be used. 

[0037] Turning back to Figure 9, in another embodi- 
ment of the invention, step 806 is skipped, and all coef- 
ficients produced by the DCT are used for subsequent 
processing. 

30 [0038] Next, the plane is subjected to inverse quanti- 
zation and inverse DCT to produce a pseudo-plane as 
indicated in step 808. As indicated in step 810, the pixel 
values in the original plane that correspond to N loca- 
tions in image plane map 304 are then replaced with 

35 pixels in the same locations in the pseudo-plane. As in- 
dicated earlier, this is an iterative process and it is re- 
peated until a designated criteria is met, as shown in 
step 812. 

[0039] In one embodiment of the invention, the proc- 
40 ess is repeated for a fixed number of iterations. An ex- 
ample of this embodiment is to perform processing only 
once (stop when K=2) and the results of that single iter- 
ation can be used. In another embodiment, processing 
stops after a comparison of either Y or N identified pixels 
45 in consecutive iterations takes place, and it is deter- 
mined that a designated amount of improvement or 
change has occurred. 

[0040] Turning again to Figure 2, once each of the re- 
spective planes is generated, they are each com- 

50 pressed using a suitable compression technique, step 
1 1 2. In the currently preferred embodiment, upper plane 
12 and lower plane 14 are compressed using JPEG 
while the selector plane 16 is compressed using a sym- 
bol based pattern matching technique such as CCITT 

55 Group 4 or a method of classifying scanned symbols into 
equivalence classes such as that described in US-A 
5,778,095 to Davies issued July 7, 1998, the contents 
of which are hereby incorporated by reference. 
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[0041] While this invention has been described in 
terms of compressing a pixel map that is represented 
as a selector plane and two image planes, those skilled 
in the art will recognize that it can be adapted to com- 
press a pixel map that is represented as a selector plane 
and a single image plane. A pixel map representation 
such as this may include an image with an associated 
mask, where the mask is used to select an irregularly 
shaped area from the image. In a representation such 
as that described, the image pixels not selected by the 
mask correspond to N locations in the image plane and 
can be processed by any ot the methods described in 
the present invention to increase the compression ratio 
of the single image plane and improve the quality of the 
decompressed image. 

[0042] In summary ; the present invention uses the se- 
lector plane to replace, for each plane, pixels that have 
been designated to be provided by the other plane by 
carefully chosen values. The previously existing data is 
completely ignored, and the newly chosen values are 
calculated for such that the number of bits that will be 
generated during the subsequent compression is mini- 
mized. 



Claims 

1 . A method of processing a pixel map representation 
of a document, comprising the steps of: 

a) obtaining (102) a set of image data signals 
which represent light intensity in a portion of a 
digital image: 

b) creating (104) a selector plane from said im- 
age data signals which separates said signals 
into subsets: 

c) creating (106) a first image plane using said 
selector plane and said image data signals, 
wherein said iirst image plane contains a first 
subset of image data signals; 

d) creating (106) a second image plane using 
said selector plane and said image data sig- 
nals, wherein said second image plane con- 
tains a second subset of image data signals; 
and 

e) processing (110-116) at least one of said first 
image plane and said second image plane 
based upon data contained in said selector 
plane. 

2. A method of processing a pixel map as claimed in 
claim 1, wherein said processing step further com- 
prises the steps of: 

a) generating an image plane map which iden- 
tifies locations in an associated image data 
subset in which image data signals that shall 
be used to reconstruct said digital image re- 



side: 

b) outputting said image plane map associated 
subset if said image plane map indicates that 
all image data signals in said subset shall be 

s used for said image reconstruction; 

c) replacing said image plane map associated 
subset with a subset of signals that have a con- 
stant value and outputting said constant signal 
value subset, if no locations on said image 

io plane map have been identified for said image 

reconstruction: and 

d) replacing all image data signals in said image 
plane map that correspond to associated sub- 
set locations not identified for image recon- 

is struction with image data signals which have 

values that will enhance said image reconstruc- 
tion and outputting said subset if at least one, 
but fewer than all locations have been identified 
for said image reconstruction. 

20 

3. A method of processing a pixel map as claimed in 
claim 1 , wherein said processing step further com- 
prises the steps of: 

25 a) inputting a threshold signal, which indicates 

an acceptable level of distortion for a subse- 
quent processing operation; 

b) inputting a variance signal, which indicates 
an average magnitude between an image data 

30 signal and an average value of all signals in 

said subset; 

c) generating an image plane map which iden- 
tifies locations in an associated image data 
subset in which image data signals that shall 

35 be used to reconstruct said digital image reside: 

and 

d) replacing all image data signals on said im- 
age plane map that correspond to associated 
subset locations not identified for image recon- 

40 struction with image data signals which have 

values that will enhance said image reconstruc- 
tion and outputting said subset if said variance 
signal is greater than or equal to said threshold 
signal and at least one, but fewer than all loca- 

45 tions in said image plane map associated with 

said subset have been identified for said image 
reconstruction. 

4. A method of processing a pixel map as claimed in 
50 claim 1 , wherein said processing step further com- 
prises the steps of: 

a) inputting a threshold signal, which indicates 
an acceptable level of distortion for a subse- 

55 quent processing operation; 

b) inputting a variance signal, which indicates 
an average magnitude between an image data 
signal and an average value of signals in said 
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subset; 

c) generating an image plane map which iden- 
tifies locations in an associated image data 
subset in which image data signals that shall 

be used to reconstruct said digital image re- s 
side; 

d) outputting an image plane associated with 
said subset if said image plane map indicates 
that all image data signals in said subset shall 

be used for said image reconstruction; io 

e) replacing said image data subset associated 
with said image plane map with a subset of sig- 
nals that have a constant value and outputting 
said constant signal subset, if no locations on 
said image plane map have been identified for is 
said image reconstruction; 

f) replacing all image data signals on said im- 
age plane map that correspond to associated 
subset locations not identified for image recon- 
struction with image data signals which have 20 
values that will enhance said image reconstruc- 
tion and outputting said image plane map as- 
sociated subset if said variance signal is great- 
er than or equal to said threshold signal and at 
least one, but fewer than all locations in said 25 
image plane map associated with said subset 
have been identified for said image reconstruc- 
tion; and 

g) replacing said image data subset with a 
plane of image data signals that have a uniform 30 
value and outputting said uniform signal subset 

if said variance signal is less than said thresh- 
old signal and at least one, but fewer than all 
locations on said image plane map have been 
identified for said image reconstruction. 35 

An apparatus for processing a pixel map represen- 
tation of a document, comprising: 

a) means for obtaining a set of image data sig- *o 
nais which represent light intensity in a portion 

of a digital image; 

b) means for creating a selector plane from said 
image data signals which separates said sig- 
nals into subsets; 45 

c) means for creating a first image plane using 
said selector plane and said image data sig- 
nals, wherein said first image plane contains a 
first subset of image data signals; 

d) means for creating a second image plane us- so 
ing said selector plane and said image data sig- 
nals: wherein said second image plane con- 
tains a second subset of image data signals; 
and 

e) means for processing said first image plane ss 
and said second image plane based upon data 
contained in said selector plane. 



6. An apparatus for processing a compressed pixel 
map as claimed in claim 5, wherein said processing 
means further comprises: 

a) means for generating an image plane map 
which identifies locations in an associated im- 
age data subset in which image data signals 
that shall be used to reconstruct said digital im- 
age reside: and 

b) means for replacing all image data signals 
that correspond to locations not identified for 
said image reconstruction with image data sig- 
nals which have values that will enhance said 
image reconstruction and outputting said asso- 
ciated image data subset if at least one, but 
fewer than all locations on said image plane 
map associated subset have been identified for 
said image reconstruction. 

7. An apparatus for processing a pixel map as claimed 
in claim 5 or claim 6, wherein said processing 
means further comprises: 

a) means for generating an image plane map 
which identifies locations in an associated im- 
age data subset in which image data signals 
that shall be used to reconstruct said digital im- 
age reside; 

b) means for outputting said image plane map 
associated subset if said image plane map in- 
dicates that all image data signals in said sub- 
set shall be used for said image reconstruction; 

c) means for replacing said image plane map 
associated subset with a subset of signals that 
have a constant value and outputting said con- 
stant signal value subset : if no locations on said 
image plane map have been identified for said 
image reconstruction; and 

d) means for replacing ail image data signals in 
said image plane map that correspond to asso- 
ciated subset locations not identified for image 
reconstruction with image data signals which 
have values that will enhance said image re- 
construction and outputting said subset if at 
least one, but fewer than all locations have 
been identified for said image reconstruction. 

A programmable storage device readable by a ma- 
chine, tangibly embodying a program of instructions 
executable by the machine to perform method steps 
for processing a pixel map, said method steps com- 
prising: 

a) obtaining a set of image data signals which 
represent light intensity in a portion of a digital 
image; 

b) creating a selector plane f rom said image da- 
ta signals which separates said signals into 
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subsets: 

c) . creating a first image plane using said selec- 
tor plane and said image data signals, wherein 
said first image plane contains a first subset of 
image data signals; 5 

d) creating a second image plane using said se- 
lector plane and said image data signals : 
wherein said second image plane contains a 
second subset of image data signals; and 

e) processing said first image plane and said 10 
second image plane based upon data con- 
tained in said selector plane. 
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(54) Method and apparatus for processing a pixel map 



(57) An improved technique for processing a color 
or gray scale pixel map representing a document is dis- 
closed. The pixel map is decomposed into a three-plane 
representation, a reduced-resolution "upper" plane (12), 
a reduced-resolution "lower" plane (14), and a high-res- 
olution binary selector plane (1 6). The "upper" and "low- 
er" planes contain the color or gray scale for the page 
as well as the continuous tone pictures that are con- 
tained on the page. The selector plane (16) stores infor- 
mation for selecting from either the foreground plane or 
background plane during decompression. Information 
contained in the selector plane is first used to pre-proc- 
ess the upper and lower planes to reduce the amount 
of data on each of the other two planes that will be sub- 
jected to further processing. Each of the pre-processed 
planes is compressed using a compression technique 
optimal for the type of data that resides upon it. 
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